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Abstract 

The analysis of the linearization effect in multifractal analysis, and hence of the estimation of moments for multifractal processes, is 
revisited borrowing concepts from the statistical physics of disordered systems, notably from the analysis of the so-called Random 
Energy Model. Considering a standard multifractal process (compound Poisson motion), chosen as a simple representative example, 
we show: i) the existence of a critical order q* beyond which moments, though finite, cannot be estimated through empirical 
averages, irrespective of the sample size of the observation; ii) that multifractal exponents necessarily behave linearly in q, for 
q > q*. Tayloring the analysis conducted for the Random Energy Model to that of Compound Poisson motion, we provide 
explicative and quantitative predictions for the values of q* and for the slope controlling the linear behavior of the multifractal 
exponents. These quantities are shown to be related only to the definition of the multifractal process and not to depend on the 
sample size of the observation. Monte-Carlo simulations, conducted over a large number of large sample size realizations of 
compound Poisson motion, comfort and extend these analyses. 

Keywords: Multifractal analysis, linearization effect, compound Poisson motion, Random Energy Model, truncated moments, 
moment dominant contributions. 
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1. Introduction 

\ Multifractal analysis is now considered as a canonical tool 
to study scaling properties and re gula rity fluctuations in time 
series (or n-dimensional fields) l33l [la 13611 . Practically, 
it essentially amounts to computing time or space averages 
of (the q-th power of) time and scale-dependent quantities 
T(a,t), leading to the so-called structure functions, S„(a,q) = 
\T{a, tk)\ q - The multiresolution quantities T(a, t) are 
computed directly from the data, and depend both on the time 
(or space) location and on the analysis scale a. Typical exam- 
ples of such quan tities T(a, f) are the increments X(t + a) - X(t) 
jof a signal X Il3l[l4ll . the wavelet coefficients or the wavelet 
Leaders ll36ll . In practice, multifractal analysis assumes that the 
structure functions behave as power laws with respect to the 
analysis scale a, in a range a,„ < a < au, with aulotm 1> 



X 



S„(a,q) a S (q)a 



(l) 



and to estimating the corresponding scaling exponent, £(q). The 
exponent £(q) is a concave function of the statistical order q. 

It has been observed and argued that the exponent ((q) nec- 
essarily behaves as a linear function of q beyond some value 
-see 11281 12711 for the original reports of the phenomenon, |31 ] 
for its analysis in the case of Mandelbrot multiplicative cas- 
cades, and fl8l Q1 (and respectively) for more recent signal 
processing (and statistical analysis, respectively) oriented con- 
tributions, in framework of the multifractal analysis of sample 
paths of stochastic processes. Following 11811 . this is referred to 
as the linearization effect in multifractal analysis, and its study 



constitutes the core of this contribution, where it is intended to 
take advantage of a formal analogy between the linearization 
effect in multifractal processes and the glass transition in the 
Random Energy Model (REM) Hill to interpret the lineariza- 
tion effect as a phase (or glass) transition. 

The REM consists in a simple model classically used in sta- 
tistical physics as an illustration of a mean-field scenario for 
the glass transition in spin-glasses 111 ill or supercooled liquids 
. In this model, all microscopic configurations have ran- 
dom independent energies drawn from the same distribu- 
tion. These energies are quenched, i.e., they do not evolve with 
time. The interest of the REM stems from the fact that it dis- 
plays a glass transition at finite temperature, and that this transi- 
tion can easily be studied analytically lUlh . The physics under- 
lying the glass transition in the REM is rather simple. Above the 
glass transition temperature, thermal activation is efficient and 
a large number of microscopic configurations are explored: the 
system is in a 'liquid' state. Below the glass transition temper- 
ature, thermal activation no longer plays a significant role, and 
the system is frozen in the few lowest energy configurations. As 
a result, its entropy per degree of freedom vanishes. The def- 
inition and main properties of the REM are briefly recalled in 
|Appendix A| The analogy between statistical physics models 
and multifractal analysis has been continuously and fruitfully 
used after the seminal contribution of Parisi and Frisch lfl4ll and 
the developments reported in J^. The linearization effect has 
been studied in the light of statistical physics models such as 
those of progressive waves lfl9l l9tl. which can be regarded as 
alternative to the REM. This is notably the case in models such 



Preprint submitted to Physica D 



April 25, 2011 



as mm. 

The rationale underlying the comparison between multifrac- 
tal analysis and REM lies in two key facts: Both the REM 
and multifractal analysis involve the evaluation of sums of ran- 
dom variables raised to a given power (constituting the con- 
trol parameter of the problem); In both cases, these random 
variables have heavy-tailed distributions though all their mo- 
ments are finite, a typical example being the lognormal dis- 
tribution. In multifractal analysis, the structure functions are 
defined as functions of the statistical order q, in the limit of 
a large number of terms. In the REM, the partition function, 
Z = Yik exp(— is defined as a function of the inverse tem- 
perature B, assuming that the number of microstates is large. 
Hence the quantities \T(a, tk)\ and exp(— £*) formally play a 
similar role, and the partition function Z is the formal analog of 
the structure function S n (a,q). Quite importantly, heavy-tailed 
distributions have the property that the dominant terms in the 
sum become very large (especially for large values of q or B in 
the present context), which turns the use of the Central Limit 
Theorem and of the Law of Large Numbers into a delicate mat- 
ter (see, e.g., [0] in the context of the REM). 

The present contribution aims at exploring the extent to 
which the statistical physics arguments, involved into the study 
of the REM to explain the zero-entropy phase transition, en- 
able to understand the linearization effect in multifractal analy- 
sis. This contribution thus further complements and enriches 
the connections between multifractal analysis and statistical 
physics 03> 0. El B ■ More precisely, given the observation 
of a finite number « of samples, taken from a single realization 
of a multifractal process, the goal of this paper is to analyze, 
using statistical physics techniques, the critical statistical order 
q* up to which the empirical average S„(a, q) allows for a cor- 
rect estimate of the ensemble average (\T(a, t)\ q ). The very ex- 
ample of multifractal processes consists in the celebrated Man- 
delbrot multiplicative cascades (see, e.g., for re- 
views). However, in the present work, use will be made of 
Compound Poisson Cascades, recently introduced in (0] (see 
also lldl "). because they benefit of statistical properties that are 
easier to handle practically and theoretically: their increments 
are stationary and characterized by a continuous scale invari- 
ance property, i.e., Eq. (Q]i above holds for a continuous range 
of scales a e [a m ,anf]- 

2. Compound Poisson cascades 

2.1. Definition of the processes 

Compound Poisson cascade (CPC) and compound Poisson 
motion (CPM) were recently introduced by Barral and Mandel- 
brot 0] and are now considered as reference multifractal pro- 
cesses. The CPC Qr(t) corresponds to a product of positive, 
independent and identically distributed random variables W*, 
referred to as multipliers, and associated to randomly located 
points (ft, r,) on a rectangle 

I r .L = joV') : r < r' < 1, -i < t' <L+ H. (2) 



More precisely, the CPC is defined for r > through 

Q r (i) = B r (t) W U 0) 

&,r ; )eC r (0 

where only multipliers associated with points belonging to the 
cone 

C r (t) = l(fS) : r <r'< 1, /- ^ <t'<t+ (4) 

are taken into account. The points (f,, r,-) are drawn from a Pois- 
son process with intensity measure dm{r, t) on the rectangle 
I n L- The parameter B,(t) is a normalizing constant such that 
(Qr(t)) = L where (...) denotes the ensemble average (or ex- 
pectation) of the process. 

It has been shown that CPC satisfy the following key relation: 

(Q r (ty<) = exp [-(p(q) m(C r (t))] , q > -1 (5) 

where m{C,{t)) — L dm(r', f) corresponds to the measure of 
the cone C r (t), and where (p(q) is defined as 

<p(q) = (1 - (W q )) - q(l - (W». (6) 

For the sake of simplicity, we only consider the case of smooth 
concave functions (p(q), a typical example of which being the 
lognormal case ip(q) = cq(l - q), with a constant c > 0. The 
CPM, X(t), is obtained by integrating the CPC, Q r (t), overtime, 
and by taking the limit r — > 0: 

X(/)=lim f Q r (s)ds. (7) 

r - >0 Jo 

This definition yields a well-defined process on condition that 
?(l-)>-l §]. 

2.2. Scaling and multifractal properties 

The increments T(a, f) of the CPM X(t), defined as 

T(a, t) = X(t + a) - X(t), (8) 

with a > 0, are positive, due to the positivity of Q r (t) (cf. 
Eq. (0). If the intensity measure of the Poisson process has 
the factorized form dm(r, f) — g(r)drdt, the increments T(a, t) 
correspond to a stationary random process Jioll . meaning that 
all the statistical properties of T(a, t) do not depend on time t. 
Interestingly, it has also been shown that the moments <(J{a, t) q ) 
are finite only for -1 < q < q c , where q c is given by |g]: 

q c = sup [q > 1 : q + ip(q) - 1 > 0). (9) 

One then expects that the probability P(T(a, t) > x) behaves 
asymptotically as P(T(a, t) > x) ~ x~ cic when x — > +oo, and 
hence that the variables T(a, t) are heavy-tailed. 

In addition, when g(r)dr = c(dr/r 2 + (5(i)(c/r)) (as proposed 
in |0]), where 6{\){dr) denotes a point mass at r — 1, the infi- 
nite divisibility underlying the construction of X(t) implies the 
following scaling properties 

(T(a, t) q ) = C q a Mq \ (10) 



for -1 < q < q c , with A(q) - q + ip(q) filial- Note that ([TOj is 
valid for all a in the interval < a < L. 

The multifractal spectrum D{h) consists of the Hausdorff di- 
mension of the set of points t on the real-line that possess the 
same singularity (or Holder) exponent h: 



T(a, t) ca 



Mf) 



0. 



(11) 



The function D(h) hence provides a global description of the 
local fluctuations of a sample path of X(t). For a thorough in- 
troduction to multifractal analysis, the reader is referred to e.g., 

US. 

From the results obtained in |01, it can be inferred that the 
multifractal spectrum D{h) of the CPM can be derived from the 
concave Legendre transform of A(q), 



A*(h) = M{qh - A(q)}, 

3 



and can be expressed as: 



DQO 



1 + A*(h), 



if 1 + A*(h) > 0, 
otherwise. 



(12) 



(13) 



Also, it is interesting to quantify the dependence structure of 
T(a, t). The two-time correlation function of T(a, f) has been 
shown to take the following form 13511 : 

(T(a, t)T(a, t+s)) = o 2 (|s + a\ A(1) + \s- a\ A{1) - 2\s\ A(1) ) , (14) 

where a 2 is the constant 

1 



o- 2 = 



A(2)(A(2)-\y 



(15) 



This two-time correlation function can be recast into the fol- 
lowing form 

(T(a,t)T(a,t+s)) = a A(2) f(^\ (16) 

with 

f(u) = a 2 (\u + l\^ + \u-l\^-2\u\^). (17) 

Eq. (TToT l shows that the variables T(a, t) are correlated over a 
typical time scale a. This result will prove useful in Sec. 13.41 
Let us however emphasize that the correlation time a appearing 
in the two-time correlation function of T(a, t) is induced by the 
"measurement" process itself, that is, the fact that T(a, t) cor- 
responds to the increment of the signal X(t) on a scale a. The 
original signal X{t) is scale invariant, and thus has no character- 
istic time scale. 

2.3. Large deviation properties 

The statistics of the increments T(a, t) has been characterized 
by the moments (T{a, t) q ), given in Eq. (fTUl i. It is also interest- 
ing to characterize this statistics through the probability density 
of T(a, t). For reasons that will appear clearer later, it is conve- 
nient to consider the random variable h a {t) defined as 



h a (t) 



In T(a, t) 
In a 



(18) 



Note that, from Eq. (fTTT i above, h a (t) corresponds, in the limit of 
fine scale a — > 0, to the Holder exponent h{t). The probability 
density function of h a (t), for a given f, is denoted as p a (h). It 
does not depend on time t due to the stationarity of the process 
T(a,t). 

We wish to show that p a (h) obeys a large deviation form in 
the limit a — > 0, namely 



Pa(h) 



|lna|(Hft) 



(19) 



A common way to derive a large deviation form is the Gartner- 
Ellis theorem 1341 Il5l 1 1211 - which also allows the explicit ex- 
pression of ifr(li) to be determined. We first define 

M^) = lim-\ln(^ ln * w ). (20) 
a^o | lna| N ' 

The function p(q) can be computed from Eq. ( fTOb . yielding 

fi(q) = -A(-q), -q c <q<\. (21) 

From the properties of A(q), it can be inferred that p(q) is a 
smooth convex function. Assuming the existence of the large 
deviation function tpili) introduced in Eq. ( fT9] l, Q the Gartner- 
Ellis theorem leads to the following expression 



if/(h) = sup{qh - fi(q)}. 

q 



(22) 



The existence of the limit moment q c implies that the previous 
equation is valid for h > h c such that 



h c = /J.'(-q c ) = A'(q c ). 

Using Eq. (1221 and the property A{q c ) - 
Eq. (0, h c can be characterized by 

i/f(h c ) = 1 - q c h c , 



(23) 

1 resulting from 
(24) 



a property that we mention for later use. 

Note that t//(h) is the convex Legendre transform of fi(q), 
which is more common than the concave Legendre transform 
appearing in Eq. ( TT2b . Using Eq. d2Tl i, the two Legendre trans- 
forms can be related in the following way: 



i//(h) = sup{qh + A(—q)} 

i 

= supj-g'/i + A(q')} 

q' 

= - suplq'h - A(q')}, 

q' 



(25) 
(26) 
(27) 



with q' — -q, leading to t//(li) = -A* (h), or equivalently, t//(h) 
I - D(h), as long as D(li) > 0. Finally, we note that the large 
deviation behaviour of p a (h) for a — > can be rewritten as 



Pa(h) ~ a 



■MID 



(28) 



If pi(q) was finite for all real q, the Gartner-Ellis theorem would imply the 
existence of i//. In the present case, where /i is finite only for — q c < q < 1 , we 
can strictly speaking only conjecture that if/ exists. 
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which closely matches the so-called thermodynamical multi- 
fractal formalism used for practical multifractal analysis, and 



13] 



relying on the heuristic assumption p a {h) ~ a' 

We can also observe that for h < h c , A* develops a linear 
branch 

A*(h)=hq c -l. (29) 

This expression may differ from the rate function if/, but at 
least provides the convex hull of if/, which is consistent with 
the infinite nature of the moments of T for q > q c : Indeed, if 

p a (h) * a-' u i'\ then P(T(a, t) > x) ~ x~«°. 

3. Critical order for empirical moment estimation 

We now assume that a single observation of the process 
X{t) is available, via a finite number of sampled times with 
a sampling period 6t. From this observation, n coefficients 
\T{a,tk),k = 1, ...,n] are computed, with f# = (k - l)5t (to 
simplify the presentation, we assume that n is independent of a, 
though this would not be strictly true in practice). The structure 
function can be rewritten as: 



S„(a,q) = - V T(a,t k ) q = - 1 e ^ h ^\ 



(30) 



In this section, we introduce a critical order q*, up to which 
the time average S„(a,q) estimates correctly the ensemble av- 
erage (T(a, t) q ), and we study how q* behaves as n — > +oo. 
The reasoning relies on combining an estimate of the number 
of independent coefficients, with two arguments inspired from 
the analysis of the REM (see J2l and Appendix A i, namely the 
identification of a dominant contribution from a saddle-point 
estimation of the g-th moment, and a truncation effect due to 
finite sample size observations. 

3.1. Dominant moment contribution 

The expression Eq. (TTOb of the moments of T(a, t) can be 
easily recovered from the large deviation form Eq. ( TT9b : 



(T(a, t) q ) = {a qKW ) 



X+co 
e -\\na\[qh + m-\ dh _ ( 31 ) 
DO 



In the limit a — > 0, a saddle-point evaluation shows that the 
dominant contribution to this integral is located at h = h m given 
by 

^(Am) = ~q, (32) 
so that the moment (T(a, t) q ) reads 

(T(a, f) q ) * e -Hn«l[9'' m +^ m )]_ ( 33) 
For h > h c , ifi(h) = -A*(h) and h m satisfies: 

(A*)'(h m ) = q. (34) 
From the properties of the Legendre transform, it also implies: 
qh m + A*(h m ) = A(q) (35) 

and 

h m = A'(q), (36) 
which implicitly defines h m as a function of q. 



3.2. Finite sample size 

In the set {h a (tk), k = 1, . . . , «}, the largest individual contri- 
bution to S„(a,q) comes, when a — > 0, from the lowest value 
of h a . To quantify the order of magnitude of the typical lowest 
available value of this set, a simple idea is to consider a thresh- 
old h' a {n) such that 



P(h a (t l ),...,h a (t n )>hl) = e- T , 



(37) 



where r > is an arbitrary constant. The notation 
P{h a (f\), . . . ,h a (t n ) > hi) denotes the probability that all the 
random variables h a {t\), . . . , h a {t n ) are larger than the value h' a . 
If the random variables \h a {tk)} were independent, Eq. < f37T > 
would, for large «, simplify to: 



P(h a < hi) 



(38) 



For the CPC, the variables {h a (tk)} are strongly dependent. We 
can however postulate that there exists an effective number n a < 
n of independent samples. We can then define hi by analogy to 
Eq. ([38}, leading to 



or equivalently, 



In 



P(h a <hl)*—, 
n„ 



(^) = -lnP(h a <hl). 



(39) 



(40) 



Let us now determine h a more explicitly as a function of n a . 
Using the large deviation form Eq. (fT9b in Eq. (1401) . ones gets: 



l n ^) = _ln J h "e-\^Wdh 



(41) 



Because if/ is a decreasing function of h on this interval, a 
saddle-point argument amounts to evaluating the integral as the 
integrand boundary value: 

ln(^)^|lnfl|^). (42) 
The threshold hi is thus determined from the implicit equation 

*«>- iE5 k (T)- <43) 

Note that the arbitrary choice of t is fading away in the limit 
a 0. 

3.3. Truncated moments and structure function 

Having introduced the threshold hi, truncated moments can 
be defined as: 

a qh p a {h) dh. (44) 

hi 

Let us emphasize that the truncated moment in principle de- 
pends on the specific choice made for the threshold hi. This 



4 



slight dependence however has no consequence on the conclu- 
sions drawn from the truncated moments M(a, q), as seen be- 
low. 

These truncated moments provide us with a relevant evalu- 
ation of (the log of) the expectation of the random variables 
constituted by the structure functions. More precisely, if we 
analyse scale by scale the signal X(t) by considering a sequence 
of scale ak = 2~ k L with w# = 2 k then 



lnS„ t (a k ,q) R . s ]nM(a,q) 
lim = hm . 

k^+oo | In | a-»o | In a] 



(45) 



A proof of this result, which mainly relies on the Borel- 
Cantelli lemma under a realistic assumption, is provided in 
Appendix B In a standard framework of i.i.d. random vari- 
ables, this limit would be far too rough to provide any useful 
insight about S„(a,q). In our case, however the limit retains 
some fundamental information about the behaviour of S „ . 

3.4. Critical order 

Combining the truncation and saddle-point arguments, we 
observe that two different situations can arise (cf. 12J]). 

When h m (q) > hl,(n a ), the truncated moment M(a, q), and 
hence the structure function S n (a, q), correctly accounts for the 
ensemble average (T(a, t) g ), which can thus be evaluated as (us- 
ing a saddle-point evaluation in the limit a — » 0): 



lim 

a-»0 



In M(a, q) 
llnal 



= -(qhm(q) + i/f(h m (q)). 



(46) 



In contrast, when h m (q) < h' a (n a ), the dominant contribution 
to the truncated moment is no longer located at h m (q) but in- 
stead comes from the lower bound h a {n a ) of the integration in- 
terval, in which case In M(a, q) reads (again from a saddle-point 
evaluation when a — > 0): 



lim 



In M(a, q) 
final 



-(qh ] a (n a ) + i//(hl(n a )). 



(47) 



This reveals that, for small enough a, S„(a,q) undergoes a 
"phase transition" when varying q, occurring at a critical order 
q* a , defined as 

h m (q* a ) = hl(n a ). (48) 

Interestingly, Eq. ( l47l i reveals a linear behaviour in q of 
In S„(a,q) when q > q*, thus accounting for the linearization 
effect reported in Q [mI QH [fl] . Note also that Eq. (gH) can 
alternatively be interpreted as the minimal number of indepen- 
dent samples n'*{q) needed to correctly estimate the moment of 
order q. 

We now investigate the asymptotic behaviour of q* in the 
limit n a — > oo (or a — > 0). In practice, this limit is obtained 
by successively considering smaller and smaller resolutions St. 
We have seen in Eq. ( TToT l that the variables T(a, t) are correlated 
over a time a, so that the same result is expected for h a {t). A 
natural estimate for n„ is thus 



L 

n a = -, 



(49) 



where L is the total length of the signal. 

We first observe that Eqs. (1431 and (|49l implies 

wto -» i 



(50) 



when a — > 0. Hence, h a {n) converges in the limit a — > to a 
finite value hi, independent of n, and uniquely determined by 
4i(h' Q ) = 1 which, in the multifractal settings, can be rewritten 

as: 



D(AJ) = 0. 



(51) 



This result is particularly interesting from the point of view of 
multifractal analysis, and its interpretation will be further dis- 
cussed in Section [33] 

Eqs. (l35T l and (l36b implicitly rely on the assumption ftj > h c , 
that we now briefly discuss. Using Eq. (l24l i and the fact that 
^ is a decreasing function from h c to h a , one can see that the 
condition h Q > h c is equivalent to 

1 < 1 - h c q c . 

Hence the property A'(q c ) = h c < implies ht > h c , a condi- 
tion which is thus true in all interesting cases, confirming the 
validity of Eqs. (f35T > and (l36l l. Combining these equations with 
Eqs. d42b and d48l i yields the relation: 



ln(» a /r) 
In a 

Using n a = L/a, we find 



(52) 



In the limit a 
order q* as: 



ln(L/r) 
In a 

-> 0, Eq. 



- 1 = q* a A'(q* a ) - A(q* a ). (53) 
defines a finite asymptotic critical 

(54) 



0=1+ q*A'{q*) - A{q*). 



The comparison of Eq. (fSTb and Eq. d54l i moreover immedi- 
ately shows that: 

h\ - A'(q*). (55) 

In summary, assimilating S n (a, q) with M(a, q) and combin- 
ing Eqs. d46b and ( f47b , we find that the empirical structure func- 
tion S„(a,q) typically behaves as a power law with respect to 
the analysis scale a when a — > 0, namely S n (a, q) ~ So(q) a^ q \ 
with £e(<?) an empirical scaling exponent. More formally, we 
can define the scaling exponent ( e (q) as the random variable: 



U4) 



lim 



In S„(a,q) 
In a 



(56) 



Eq. d45l l then implies that £ e (q) is almost surely equal to its 
average value (£ e (q)} = ((q): 

Uq) = m- (57) 

The value of ((q) can be expressed, using Eqs. d46b and ( l47b . as 



A(q), 

1 + qA'iq"), 



-1 < q < q* 
q > q*. 



(58) 



From a practical viewpoint, the above results can be summa- 
rized as follows in terms of the structure function S n {a,q). If 
q < q\ 

In S„(a,q) « A(q) In a, (59) 

while if q > q*, 



\nS n {a,q) « (1 + qA'{q*)) Ina. 



(60) 



3.5. Comments on the critical order 

To sum up, Eqs. (|54]i, (|55]l, d58]l, (|59j and d60]l constitute the 
most important results of the present contribution, that call for 
a number of comments. 

i) For multifractal processes such as CPM, the time averages 
(or structure functions) S„(a,q) do not converge at large n to 
the ensemble average (\T(a, t)\ q ), for q > q*. 

ii) It is important to note that q* + q c . Using Eqs. (f54-b and 
(0, it can easily be shown that q* < q c . Therefore, the critical 
order up to which S n {a,q) accurately estimates the ensemble 
average is not related to the finiteness of the moments of \T(a, f)| 
but occurs for much lower values of q. 

iii) The critical order q* and the critical Holder exponent hi 
are found to be independent of the actual number « of avail- 
able samples. Therefore, increasing « (through a decrease of 
the sampling period 5t) does not allow for a significantly better 
result. Moreover, Eq. (l53l l shows that in practice the effective 
critical order at scale a only weakly varies with n or a. Note that 
for given specific models A(q), it is possible that the solution of 
Eq. (l54t is q* = +oo, which can either be understood as the fact 
that the linearization effect does not occur for such cases or (our 
preferred interpretation) that the linearization effect is a general 
effect that is rejected at infinity for those particular cases. 

iv) The above properties, which appear as consequences of 
Eq. ( BTT l. can be interpreted as follows, in a way closely par- 
alleling the arguments of the REM (see Appendix Ai. In a 



given sample, the number of independent points having a sin- 
gularity exponent h scales as n a e^ (h ^ Xaa \ for a — > 0. Using 
n a = L/a, the above number thus scales as e^ 1 '^"' ^. This 
means that in a given sample, there will be a large number of 
points with singularity h when 1 - if/(h) > (corresponding to 
D(h) > 0), while there will be no such points in a typical sam- 
ple when 1 - iff(li) < 0, irrespective of its observation duration 
L and of the analysis scale a. The value hi, such D(h^) = (cf. 
Eq. (IBTTl). therefore receives a simple interpretation within this 
framework. The analogy with the REM can even be pushed 
further. Given the correspondance between, on one-hand, the 
partition function Z of the REM and the structure function S „ 
in the multifractal case, and, on other-hand, the inverse tem- 
perature /3 and the order q of moments, we find that the analog 
of the entropy per degree of freedom (which is zero in the low 
temperature phase of the REM /3 > /3„ = T~ ) is the quantity 
q('{q) - ((q) + 1, which is indeed equal to zero in the linear 
regime obtained for q > q*. Table [Tj sketches the correspon- 
dence between the quantities defined in the REM and in multi- 



fractal analysis (see Appendix A for the definition of the nota- 
tions used in the REM). 

v) It is worth mentioning that the interpretations of the analy- 
ses reported above in terms of h compared to h' Q and entropy had 



MF 


REM 


S (a a\ x — Y"" T(a t, }4 


ZI2 N - -±- y 2 " e~ pE ' 


In n a ~ — In a 


1« « AT 1« 

in n — 7V in z 


1 


P — 1 


In T(a, t kj ) 


-Ej 


hjtk,) 


ej/ln2 


-q~ l In (n a S„(a,q)) 


F = -rinZ 


qC(q) - ((q) + 1 


S/(N ln2) 


q* 


Pg 


K 


e"'71n2 


h,„ 


e,„/ln2 



Table 1: MF vs. REM. Mapping between quantities denned in the multifractal 
analysis (MF) and in the Random Energy Model (REM), valid in the limit of 
small a and large N. In order to interpret S„(a, q) as a sum of (almost) indepen- 
dent variables, a sequence of n a times [tit,,] = 1, ■ ■ ■ , n a \ is extracted from the 
full set {t k ,k = 1, . . . ,n}. 



already been envisaged by B. Mandelbrot in a series of seminal 
articles dedicated to detailed practical aspects of multifractal 
analysis, the most prominent of them being e.g., 12 ll 12211 . 

vi) The theoretical analysis of the linearization effect ob- 
tained in the present contribution from REM-type statistical 
physics arguments is similar to (and hence fully conforts) the 
conjecture formulated in flil flll. stemming from the interpreta- 
tion in terms of extreme values and local regularity of empirical 
observations obtained from the application of the multifractal 
formalism to numerical simulations of CPM and other related 
multifractal processes. 

vii) Complementary theoretical analysis (as in Section [3]) 
and numerical analysis (as in Section [4] below) conducted for 
multifractal processes other than CPM (not reported here) sug- 
gest that the results obtained here for CPM are valid for much 
broader classes of multifractal processes (cf., e.g., 11811 ). 



4. Monte-Carlo simulations 

In the study of systems such as the REM, or, more gener- 
ally, in the frozen phase of spin glasses, the condensation of 
explored configurations onto a small subset is classically mea- 
sured using a theoretical or numerical tool referred to as the 
participation ratio lEME^ . In this section, we make use of this 
tool to further analyze the linearization effect in the context of 
multifractal analysis. 

The definition of the participation ratio p(a, q, p) is taylored 
from classical formulations in statistical physics to the context 
of multifractal processes, with n a = L/a: 



p(a, q, p) = 



(61) 



In the analysis of the REM, it can be shown that, in the glassy 
phase p > f3g (associated here to q > q*), the participation 
ratio is non-self-averaging, which means that it depends on the 
explicit observation (or sample) of the process X(t), and hence 



of its increments T(a, t), even in the limit a — » 0. Therefore, in 
that limit, its expectation satisfies the following explicit closed- 
form formula, for all p > 1 and q > 0, 125l 12611 : 



lim a _ <p(a, q,p)) = 



np-q'lq) 

T(p)n\-q'lq) 



if q < q* , 
if q > q* 



(62) 



The expected behavior recalled in Eq. d62b is now tested nu- 
merically, in the context of CPM, by means of Monte-Carlo 
simulations. The p{a,q,p), as defined in Eq. (loTt above, 
are computed over 500 independent realizations of CPM of 
length 2 22 , for q = 1, ...,15, p = 2,4,5 and a = 2 J , with 
j = 1, . . . , 18. The ensemble average (p(a,q,p)) is estimated 
by the average (p(a,q, p)) M c of the p(a,q,p) over the inde- 
pendent realizations. The expected (p(a,q,p)), according to 
Eq. d62l , and (p(a, q, p))mc are compared in Fig.Q] For q <§: q* ', 
(p(a,q, p))mc - (p(a,q,p)) = 0, for all p > 1 and a > 0. 
For q » g*, (p(a,q, p))mc departs unambiguously from 0, for 
all p > 1 and a > 0, and moreover follows a dependence in 
q and p, that globally matches that of (p{a,q,p)), expected 
from Eq. d62l ). The transition from zero to non-zero values of 
(p(a, q, p))mc occurs for values of q typically around q*, as the- 
oretically computed from Eq. (154-b . 

The match between the expected (p{a,q,p)), according to 
Eq. d62l and (p(a,q, p))mc is not perfect though. This may 
stem from a number of causes. On the one hand, the [E^, k = 
1, ...,«} in the REM and the {In T(a, t k ),k=l,..., n a } in CPM, 
though both heavy-tailed might have not exactly the same dis- 
tributions. On the other hand, the derivation of the theoretical 
results in Eq. ( f62b relies on an exact independence assumption 
of the [Ek, k — 1, . . . , «}, while the {In T(a, k),k — 1, . . . , n a ) still 
remain significantly correlated, as predicted by Eq. ( fT4b . which 
may affect the limiting ensemble average. Note that results are 
shown for the arbitrary scale a — 4 only, as all conclusions 
drawn above are identical at all scales. 

These empirical observations are regarded as satisfactory re- 
sults, corroborating numerically the theoretical analysis of the 
linearization effect observed in multifractal analysis and con- 
ducted from REM-type arguments. 

5. Discussion and conclusion 

In this contribution, it has been shown that the time averages 
(or structure functions) of the (q—th power of the) increments 
of the sample path of Compound Poisson Motion, chosen as 
a simple representative of multifractal processes, cease to ac- 
count correctly for the (ensemble average) moments of order q, 
above a critical order q*. This critical order is entirely defined 
from quantities entering the definition of the process and does 
not depend on the sample size of the observation: Increasing 
this sample size (by decreasing the sampling period 6t) does 
not permit to increase the range of orders q for which moments 
can be correctly estimated. This critical order is not related ei- 
ther to the lack of finiteness of the moments. Moreover, for 
q > q*, the structure function still exhibits power-law behaviors 
with respect to scale a, with scaling exponents that however be- 
have linearly in q. Both the critical order q* and the slope of the 



linear behavior are predicted quantitatively. These predictions 
are obtained from the tailoring of statistical physics arguments 
involved in the analysis of the REM to multifractal processes. 
The reason why increasing the sample size does not permit a 
correct computation of the moments for q > q* can be under- 
stood as the non-self-averageness property in the glass phase 
of the REM. This correspondence is reminiscent of the analogy 
that leads from the thermodynamical formalism to the multi- 
fractal formalism, commonly used to measure the multifractal 
spectrum from empirical data Jl4l \^ . 

This contribution can hence be read as a further effort to 
make explicit the fruitful correspondences between the ther- 
modynamical and multifractal formalisms, in the spirit of e.g., 
Il4ll32i [3l [l3ll . with a specific emphasis on marrying in a single 
point of view different perspectives on the linearization effect: 
that of stochastic process sample path based statistical estima- 
tion, that of statistical physics and that of local regularity func- 
tional analysis. 

Monte-Carlo simulations, based on the numerical synthesis 
of independent sample paths of CPM and estimation of the par- 
ticipation ratio, a classical tool in the statistical physics of con- 
densed matter, satisfactorily confirm these predictions. These 
predictions based on REM-type statistical physics arguments 
are in perfect consistence with those proposed in jl8i llll. based 
on an extreme value analysis of the multifractal formalism and 
provides a complementary understanding of why time averages 
do not converge to ensemble averages. 

The analysis conducted here can be, mutatis mutandis, ap- 
plied straightforwardly to other multifractal processes. This is 
notably the case for fractional Brownian motion in multifractal 
time 12411 . which is obtained by subordinating CPM to a clas- 
sical fractional Brownian motion and which constitutes a very 
appealing model to account for the multifractal properties of 
real data. Monte-Carlo simulations, not reported here, such as 
those described in Section|4] performed on fractional Brownian 
motion in multifractal time, yield conclusions in perfect consis- 
tency with those drawn from the analysis of CPM. 

In addition, the analysis conducted here can also naturally 
be extended to multiresolution quantities other than the incre- 
ments. We performed Monte-Carlo simulations on CPM and 
fractional Brownian motion in multifractal time (not shown 
here) using increments of order P (i.e., increments of incre- 
ments of increments. . . ) as well as wavelet coefficients (com- 
puted from mother wavelets with different number of vanishing 
moments, see e.g., l20ll ). These simulations also lead to com- 
parable conclusions. 

Furthermore, this work opens the track for a systematic defi- 
nition and estimation of a critical order for the moment estima- 
tion, in different contexts where the variables of interest consist 
of random exponentials, as in 0]. This, together with the prac- 
tical estimation of the critical order from a finite sample size 
observation, is under current investigation J2]. 
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Appendix A. Random Energy Model 

A very simple disordered model, which nevertheless captures 
a lot of the phenomenology of realistic disordered systems, has 
been proposed by Derrida, and called Random Energy Model 
(REM) liU]. It can be thought of as a spin model, although 
spins do not play any essential role in the description. Consid- 
ering a system of N spins, the corresponding number of con- 
figurations is n — 2 N . To each configuration j is associated a 
random energy Ej drawn at random from a distribution P(E): 



P(E) 



1 



va^T 2 



exp 



E?_ 

'nj 2 



(A.l) 



The energies Ej are independent and identically distributed ran- 
dom variables. We denote as p(E)dE the number of configura- 
tions with energy in the interval [E, E + dE], so that p(E) is the 
density of configurations with energy E. The density p(E) is a 
random quantity, but its fluctuations are small if p(E) is large, 
namely p(E) as (p(E)). By definition, (p(E)) = nP(E), leading 
to 

II 
NJ 2 t 

F 1 

In n 1 1 - 



(P(E)} 



exp IN In 2 



= exp 



J 2 In 2 



(A.2) 



where the energy density s = E/N has been introduced. One 
sees that if 1 - e 2 /(/ 2 ln2) > 0, corresponding to |e| < e' = 
/ Vln 2, (p(E)} is exponentially large with N, so that there is 
a large number of configurations at energy density s, and the 
assumption p(E) as (p(E)) is justified. In contrast, if |e| > e', 
{p(E)) is extremely small for large n. This means that in most 
samples, there are no configurations at energy density |e| > e' . 
The non-zero, but small value of (p(E)) comes from the contri- 
bution to the average value of very rare samples, which include 
some configurations with exceptionally low (or high) energy. 

We can now evaluate the partition function of the REM, de- 
fined as 

Z = ^e- EklT . (A.3) 
j=i 

This partition function is a random variable, the typical value 
of which can be evaluated as follows: 



In the large n limit, we can evaluate Z typ through a saddle-point 
calculation, namely 



-typ 



,-(lnn) j mi „(£ T ) 



(A.7) 



where g m i n (e ) is the minimum value of g(s) over the interval 
[-e 1 , e']. Let us first consider the value e m which minimizes 
g{s) over the entire real line. Taking the derivative of g(s), one 
has 

2s 1 

Sis) = TTT-^ + (A.8) 



From g'(s) = 0, we find 



/ 2 ln2 rin2' 
J 2 

hn- 



(A.9) 



As g(s) is a parabola, it decreases for s < s m and increases for 
s > s m . If s m > -e + , then g min (e t ) = g(e m ), so that 



-typ 



-Ng(s m ) 



(A. 10) 



The condition e m > -e' translates into T > T g , where the so- 
called glass transition temperature T g is defined as 



T, = 



./ 



2 Vln 2 



(A. 11) 



For e m < -e', or equivalently T < T s , g(s) is an increasing 
function of e over the entire interval [-«' , s'], so that gmi n (e' ) = 
g(-e'), and 



-typ 



(A. 12) 



From these estimates of Z typ , one can compute the free energy 
F = -rinZ ty p, and the entropy S = -dF/dT. For T > T„, one 
finds 

F = -AMrin2 + — |, 



4T 



leading for the entropy to 



S =N ln2- 



4T 2 



For T < T g ,we have 



F = TNg(-e ] ) = -NJ Vln 2. 



(A.13) 



(A. 14) 



(A.15) 



The free energy does not depend on temperature in this range, 
so that the corresponding entropy vanishes: 



5=0, 



T < T s . 



(A. 16) 



It can also be checked that the entropy given in Eq. dA.14t for 



de(p(s)} e~ Ne l T , (A. 4) T > r s vanishes continuously for T — > T s . Hence the temper- 

s' ntnrA T nnrrpcnnnHc tr\ a alucc trnncitinn tpmnpraturp \\iYif*re 



with the notation p(s) = Np(Ns). In the above equation, we 
have replaced p(s) by (p(s)) for |e| < e"'", and by for |e| > e T . 
We can then write 



with 



Z typ = J dse-V nn)g(s) 



s e 
g(e) = -r-^ + r - - 1. 



7 2 ln2 Tln2 



(A.5) 



(A.6) 



ature T g corresponds to a glass transition temperature, where 
the entropy goes to zero when lowering temperature, and re- 
mains zero below T s . Actually, to make the statement sharper, 
only the entropy density S /N goes to zero for T < T g , in the 
infinite limit. Computing subleading corrections to the en- 
tropy S , one finds that S is independent of Af, but non-zero, for 
T <T g . The entropy is then intensive in this temperature range, 
meaning that only a finite number of configurations, among the 
n — 2 N possible configurations, are effectively occupied: the 
system is quenched in the lowest energy configurations. 



Appendix B. Almost sure convergence of In 5/| In a\ 

The aim of this appendix is to sketch the proof that for n k = 

2 k and a k = 2~ k L: 

\nS nk (a k ,q) a . s \aM(a,q) 
hm — ; — = hm — - — ■ — . (B.l) 

a-^+oo I lna^l a-»o |lna| 

We refine the definition of h' by choosing 

1 



The quantity m p can be bounded using the Borel-Cantelli 
lemma, leading to 



\Ip) a.s a.s 2 

— < m p < (In n k Y {I p ) 



(B.13) 



Tk 



k~ 



(B.2) 



in Eq. ( 1391 . instead of a constant r. This alteration does not 
affect hi due to the property 



The upper bound is a direct consequence of the classical 
Markov's inequality. However the lower bound is more subtle 
because it requires the use of the Chebyschev 's inequality and 
consequently a bound for the correlation of the^/ ; . Let us make 
the realistic assumption that such a bound exists. Moreover the 
vanishing length of the intervals I p implies that : 



dp) 



k_ a Hc P ) 



(B.14) 



lim J^L = 0. 



a-->+c» | In a a- I 
With this choice of r k , we have 



(B.3) 



J] P(3i < n k , h ak (i) < h[) < +c». (B.4) 

k 

The Borel-Cantelli lemma states that if a sequence of events A k 
satisfies £ P(Ak) < +°°, then the event A k only happens for 
a finite number of k. Choosing the events to be A k = (3i < 
n k , h ak (i) < h\), we find that for sufficiently large k, all the h a 
are almost surely larger than h, . Denoting^/ the characteristic 
function of the set /, 



£->+oo \nn k 

Hence, we have E\ e (0, 1) such that for sufficiently large k, 

S„ t (a k ,q) < ^(ln«i.) 2 4(l + e 1 K c " + < / ' (^ > , + m^". (B.15) 
p 

We are mainly interested in the extremal contribution, which 
comes from c m : 

~qc m +tl/(c m ) 

(1 - s 2 )l k - - < S„ k (a k ,q) < (1 + s^lnn^ha"^^ 
2 Ink 

(B.16) 

with si, £3 € (0, 1). These two inequalities can be rewritten in 
logarithmic terms, 



1 ifxel 
otherwise 



(B.5) 



-(qc m + i//(c m )) + 



ln(l - £2) + lnZ* - In 2 - lnlnn^ 



we can write 



S nt (a k ,q) = -ya c ' h -x [hUoo) (hd. (B.6) -(qc m + if,(c m )) 

K i=l 



s In S„ t (a k ,q) a.s 

: = < 

I In a k \ 

3 In In n k + ln( 1 + £3) + In l k 
I lnal 



(B.17) 



In order to refine this result, we partition the interval [h ' ', +00) 
in different sub-intervals. First, we define a separation point Coo 
by 

i//(Coo) = 0. (B.7) 
The corresponding sub-interval is 

loo = (Coo, +00). (B.8) 

The remaining interval [h^, c m ] has a finite length l k 



The total length l k is bounded because so are h, and c ro . When 
k — > +00, the length of the interval tends to 0, which means 
that 

c,„ -> max(/i„„/i'), (B . 1 8 ) 

Thus taking the limit k — > +00 in Eq. (IB. 17b leads to 



lim 

k—. >+°° 



In S„ t (a k ,q) a .s 



I In a k \ 



= -q max(/! ; „, h ' ) - ^(max(/i,„, /z^)). 



l k — Coo h lr 



(B.9) 



We partition this interval into a number [2 In n k ] of sub-intervals 
(where [x] denotes the integer part of x), with 



c p — h ' + p - 



(B.10) 



1 + [ln^] 

I p = [c p ,c p+1 ], \p\ < 1 + [Inn*] (B.ll) 
We call then m p the density of points inside the interval I p 



1 "* 

P = -p Ip (hm. 



(B.12) 



(B.19) 

The right hand side of the previous equation exactly corre- 
sponds to the evaluation of the truncated moment by the saddle- 
point method obtained in Eqs. d46*b and d47l i. which demon- 
strates the validity of Eq. dB.U . A similar result for the dyadic 
multiplicative cascade (cf 13 110 suggests that the almost sure 
convergence holds in the general case even without the assump- 
tion made in order to obtain the lower bound. 
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Figure 1: Participation ratio. Solid black line: (p(a,q,p)}, according to Eq. 
(62} ; dashed red line: (fi(a,q, p))mc averaged over Monte-Carlo simulations, 
with 95% confidence intervals ; red vertical dashed line: position of the critical 
q* as computed from Eq. <54t . Top: p = 2, middle p = 4, bottom p = 5, for 
scale a = 4. 
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